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Abstract 

Distance oracles are data structures that provide fast (possibly approximate) answers to shortest- 
path and distance queries in graphs. The tradeoflf between the space requirements and the query time 
of distance oracles is of particular interest and the main focus of this paper. 

In FOCS'Ol, Thorup introduced approximate distance oracles for planar graphs. He proved that, for 
any e > and for any planar graph on n nodes, there exists a (1 + e)-approximate distance oracle using 
space 0{ne~^ logn) such that approximate distance queries can be answered in time 0(e^^). 

Ten years later, we give the first improvements on the space-query time tradeoff for planar graphs. 

• We give the first oracle having a space-time product with subquadratic dependency on 1/e. For 
space O(nlogn) we obtain query time 0(e~^) (assuming polynomial edge weights). We believe that 
the dependency on e may be almost optimal. 

• For the case of moderate edge weights (average bounded by poly{\ogn), which appears to be the case 
for many real- world road networks), we hit a "sweet spot," improving upon Thorup's oracle both in 
terms of e and n. Our oracle uses space 0(n log logn) and it has query time 0(e~^ -f- logloglogn). 

(Notation: O(-) hides low-degree polynomials in log(l/e) and log*(n).) 



1 Introduction 



Distance oracles [TZ05] generalize the all-pairs shortest paths problem as follows: instead of computing 
and storing a distance matrix (with quadratic space requirements and pairwise distance computations that 
require one table look-up only), we wish to compute a data structure that requires sub-quadratic space S but 
still allows for efficient (as in sublinear query time Q) distance computations. Depending on the application, 
it may be acceptable to output approximate answers to shortest-path and distance queries. The estimate 
provided by the distance oracle is supposed to be at least as large as the actual distance. The stretch a > 1 
of an approximate distance oracle is defined as the worst-case ratio over all pairs of nodes of the query result 
divided by the actual shortest-path length. 

Distance oracles can potentially be used in applications such as route planning and navigation [Gol07, 
Zar08, DSSW09], Geographic Information Systems (GIS) and intelligent transportation systems [JHR96], 
logistics, traffic simulations [ZKM97, BBJ+02, RN04, BG07], computer games [Sto99], server selection [NZ02, 
DCKM04, CCRK04, ST08, EBN09], XML indexing [STW04, STW05], reachability in object databases, 
packet routing [SS80], causal regulatory networks [CZE+11], and path finding in social networks [Kar29, 
Mil67, NewOl]. 

For general graphs, distance oracles use large amounts of space, or they have long query time, or their 
stretch is at least two [TZ05, SVY09, PRIO]. In this work we consider planar graphs, for which the known 
tradeoffs between stretch, space, and query time are much better (see Table 1 for an overview). 

One important reason for this better tradeoff performance is that algorithms can make use of small 
separators [Ung51, LT79, Mil86]. For approximate distance oracles, separators consisting itself of a small 
number of shortest paths are particularly useful [Tho04, Kle02]. Separator-based approaches however often 
use recursion of logarithmic depth, which manifests itself by logarithmic factors in either the space require- 
ment or the query time (or even both). Currently, the best tradeoff is provided by Thorup's approximate 
distance oracle, for which we provide a brief technical outline in Sections 2.1 and 2.2. The best exact distance 
oracles [FR06, Dji96, CXOO, Cab06, Nusll, MS12] have a space-query time product S ■ Q proportional to 
roughly ny^. For the best (1 -I- e)-approximate distance oracle [Tho04], that product S ■ Q is 0{n\ogn) for 
constant e. 
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Table 1: Time and space complexities of distance oracles for undirected planar graphs (some results extend 
to planar digraphs). Existing results are for arbitrarily weighted graphs; our new results are for graphs 
with polynomial weights (Theorem 1) and for moderately weighted graphs (Theorem 2) only. log*(n) is the 
iterated logarithm of n. For the bounds corresponding to our results, O(-) hides low-degree polynomials in 
log(l/e) and log*(n). 

Our main result is a distance oracle that improves upon these tradeoffs in terms of both n and e for graphs 
with moderate edge weights. By moderate we mean that, after normalization such that the smallest edge 
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weight is 1, the average weight is bounded by poly(\ogn). We beheve that this is a reasonable assumption 
as the average weight for the European road network (the version made available for scientific use by the 
company PTV AG) appears to be not too large. ^ We provide a (1 + e)-approximate distance oracle with 
the following characteristics: for constant e, the space is S* = 0(nloglog(n) log*(n)) and the query time 
is Q = O(logloglogn), thus S ■ Q = o(n log n). We show how the logarithmic dependency on n, which 
is quite common in separator-based approaches, can be avoided entirely, which may be interesting in its 
own right. ^ Equally important, our construction also yields the first space-time product with subquadratic 
dependency on e (known constructions have e^^ in both space and query [Tho04, Klc02] or in the query 
complexity [KKSll]). 

More precisely, we prove the following. All our bounds arc in the word RAM model [CR73]. 

Theorem 1. For any undirected planar graph G onn nodes with edge weights polynomial in n and for any e > 
there exists a {I +e) -approximate distance oracle with query time 0(e-i log^ (1/e) log log(l/e) log* (n)) using 
space 0{n [log(n) + log(l/e)] log* (n) log log(l/e)), and preprocessing time 0{ne~^ log^(n) log*(n) +SAfC{n) ■ 
logn), where SAfC{n) denotes the time required to compute a sparse neighborhood cover. 

Theorem 2. For any undirected planar graph G on n nodes with average weight < log^ n for some 
constant 9 > 0, and for any e > there exists a (1 + e) -approximate distance oracle with query time 
0{e~^ log^(l/e) loglog(l/e) log*(n)+logloglog(n)) using space 0{n [loglog(n) + log(l/e)] log*(n) loglog(l/e)), 
and preprocessing time 0(?ie^^ log^(rt) loglog(n) log*(n) + SJ\fC{n) ■ loglogri), where SMC{n) denotes the 
time required to compute a sparse neighborhood cover. 

As a starting point, we provide the following distance oracle with constant stretch. 

Proposition 1. For any undirected planar graph G on n nodes with average weight bounded by log^ n for 
some constant 9 >0 there exists an 0{1) -approximate distance oracle with query time O(logloglog?i) using 
space O{n\og\ogn) , and preprocessing time 0{n\o^ n + SAfC{n)-\og\ogn), where SI\fC{n) denotes the time 
required to compute a sparse neighborhood cover. 

Somewhat surprisingly, nothing more efficient than Thorup's (1 + e)~stretch oracle had been known for 
arbitrary constant approximations. The only 0(l)-stretch (as opposed to (l-l-e)-stretch) oracle constructions 
for planar graphs we are aware of are [Chc95, ACC+96, GKROl] and the following two indirect constructions: 
one construction is by using a routing scheme [FJ89, FJ90] and another construction is by using an (.ao~ 
embedding [KLMN04]. The space-query time tradeoff of our data structure is better than that of all the 
previous constructions. 

Techniques Our data structures make extensive use of Thorup's approximate distance oracle [Tho04] 
and of a large variety of techniques developed for planar graphs in recent years. We also introduce a new 
kind of r-divisions [Frc87] based on shortest-path separators. Other techniques we use include crossing 
substitutes [KS98], sparse neighborhood covers [BLT07], generalized dominating sets [KP98], and the Cycle 
MSSP data structure [MS12]. 

2 Preliminaries 

We use standard terminology from graph theory, see for example [Die05]. Graphs we consider are planar, 
undirected, unweighted (if not explicitly mentioned otherwise — the extension to moderately weighted graphs 

^The network covers European 14 countries and it has 18,010,173 nodes and 42,560,279 edges. It serves as an important 
benchmark graph for shortest-path query methods [DGJ08]. For the travel time metric (scc-eur2time.gr), the total weight is 
21,340,824,356, which yields an average of approximately 501. For the distance metric (scc-eur2dist.gr) the two values are 
9,420,195,951 (sum) and 221 (rounded average). Also, logj (42560279) 25.34 and thus (logj |E|)2 > 501. We do not claim to 
rigorously distinguish 0{poly(\og\E\)) from n{y/\E\) for this E (here ^42560279 ^ 6523.82). 

^Recently, the complexity of some optimization problems for planar graphs has been improved [WNlOb] from worst-case 
running time 0(n log n) to 0{n log log n) using fast r-divisions [Frc87] (and other techniques). For approximate distance oracles, 
a similar approach improves the space requirements at the cost of the query time [KKSll]. In this work, we improve the tradeoff, 
i.e. the product of space and query time. 



2 



is emphasized in the pseudocode), and they have n nodes. 

Let log*(-) denote the iterated logarithm function (also super-logarithm), which is defined as log* n = 
f + log* (log n) for n > 1 and as for n < 1. 

2.1 Planar Separators and r— divisions 

A separator for a graph G — {V, E) is a subset of the nodes 5 C 1^ such that removing the nodes in S from 
G partitions the graph into at least two disconnected components. Let us assign a weight w G [0, 1] to every 
node V E V. A separator is deemed balanced if none of the resulting components has weight more than a 
constant fraction p of the total weight for some constant p < 1. 

Planar graphs are known to have small separators: for any planar graph there exists a balanced separator 
consisting of 0{y/n) nodes [Ung51, LT79, Mil86]. Recursively separating a graph into smaller components, 
we obtain a division into edge- induced subgraphs. A node of G is a boundary node of the partition if it belongs 
to more than one subgraph. An r-division [Fre87] partitions G into 0{n/r) subgraphs, called regions, each 
consisting of 0{r) edges with at most 0{^/r) nodes on the boundary. 

Separators can be chosen, for example, to form a cycle [Mil86] or also to form a set of paths [LT79, Tho04]. 
Lipton and Tarjan [LT79] prove that, for any spanning tree T in a triangulated planar graph, there is a non- 
tree edge e such that the unique simple cycle in TU{e} is a balanced separator (fundamental cycle separator). 
Thorup [Tho04] uses this construction with a shortest-path tree T, rooted at an arbitrary source node. For 
any node u, let T{u) denote the tree path from u to the root. 

Lemma 1 (Shortest-Path Separability; Thorup [Tho04, Lemma 2.3]). In linear time, given an undirected 
planar graph H with a rooted spanning tree T and non-negative vertex weights, we can find three vertices u, 
V, and w such that each component of H \ V{T{u) U T{v) U T{w)) has at most half the weight of H . 

Since T is a shortest-path tree, a separator S consists of at most three shortest paths. 

2.2 Approximate Distance Oracle and Labeling Scheme 

The approximate distance oracle for planar graphs by Thorup [Tho04] can be distributed as a distance 
labeling scheme [PelOO, GKK+01, GPPR04]. Each node u is assigned a label C{u) such that there is a 
decoding function !?(•, •) that approximates the distance between u and v based on their labels only. 

Lemma 2 (Thorup [Tho04, Theorem 3.16 and Theorem 3.19]). There is an algorithm that computes (1 + e)- 
approximate distance labels for an n-node planar graph with the following properties. The algorithm runs in 
time 0{ne~'^\o^ n) and outputs labels of length O(e^^logn) with query time 0(e"^). 

One of the key ideas in Thorup's oracle is the concept of shortest-path tree separability as outlined in the 
previous section (Lemma 1). A constant number of shortest paths Q separate the graph into components 
of at most half the size. Another key idea is to approximate shortest s — t paths that intersect a separator 
path Q. Let the shortest s — t path intersect Q at a particular node q E Q. If we are willing to accept a 
slightly longer path, we can restrict the number of possible intersections from \Q\ to 0(l/e), for e > 0. We 
use the following variant of Klein and Subramanian [KS98] (which they also call a crossing substitute) . 

Lemma 3 (Klein and Subramanian [KS98, Lemma 4]). Let Q be a path of length 0{D) for an integer D. 
Let G C Q be a set of 0(1/ e) equally spaced nodes on Q, called the e-cover of Q. Any pair of nodes {u,v) 
at distance [D, 2D) whose shortest path intersects Q can be (1 -I- e)-approximated by a path going through a 
node c £ C , that is, 

d{u, v) < minc?(u, c) -|- d{c, v) < [1 -\- e)d{u, v). 



The proof is based on the observation that any detour using node c G C instead of node q could cause 
an additive error of at most 0{eD). 
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2.3 Distance— (5 Dominating Sets 

Let ^ < n be an integer. A S -dominating set of a graph G = (V, E) is a subset L CV of nodes such that for 
each V G V there is a node I G L at distance at most i5. It is well-known that there is a J-dominating set L 
of size at most \L\ < n/ {6 + 1) and that such a set L can be found efficiently [KP98]. 

A related but hard (even for degree-3 planar graphs) problem is the minimum p-center problem [PleSO, 
DF85, Ple87], where we want to compute a set f7 C F of size p = |C/| such that the maximum distance from 
any node w g to ?7 is minimized. We are mainly interested in that worst-case distance, for which the best 
guarantee is n/p. 

2.4 Sparse Neighborhood Covers 

Busch, LaFortune, and Tirthapura [BLT07] provide sparse covers [AP90, ABCP98, AGMW07, BLT07] for 
planar graphs. 

Lemma 4 (Busch, LaFortune, and Tirthapura [BLT07]). For any planar graph G and for any integer r, there 
is a sparse cover, which is a collection of connected subgraphs (Gi, 6*2, ... ); with the following properties: 

• for each node v there is at least one subgraph Gi that contains all neighbors within distance r, 

• each node v is contained in at most 30 subgraphs, and 

• each subgraph has radius at most p = 24r — 8 

(a graph has radius p if it contains a spanning tree of depth p). 

Furthermore, such a sparse cover can be computed in polynomial time. 

Note that, since each node is in at most 0(1) subgraphs Gi, the total size of the cover is 0(n). 

While for minor-free graphs the best-known construction algorithm requires polynomial time, their algo- 
rithm for planar graphs appears to actually run in time 0{n\ogn). Let SMC{n) denote the time required 
to compute sparse neighborhood covers. We state our preprocessing bounds with respect to SMC{n). 

2.5 Exact Distance Oracles and Cycle MSSP 

We use the Cycle Multiple-Source Shortest Paths (MSSP) data structure [MS12], which is a variant of Klein's 
MSSP data structure [Klc05]. 

Lemma 5 (Cycle MSSP [MS12, Theorem 4]). Given a directed planar graph G on n nodes and a simple 
cycle G with c — 0{\Jn) nodes, there is an algorithm that preprocesses G in O(nlog^ri) time to produce a 
data structure of size O(nloglogc) that can answer the following queries in 0(c log^ clog log c) time: for a 
query node u, output the distance from u to all the nodes of G . 

We also use an exact distance oracle for small subgraphs. 

Lemma 6 ([MS12, Theorem 3]). For any directed planar graph G with non-negative arc lengths, there is a 
data structure that supports exact distance queries in G with the following properties: the preprocessing time 
is 0(ri log(n) log log(7i)), the space required is O(nloglogrt), and the query time is 0{\/nlog^ (n) log\og{n)) . 



3 A More Compact Distance Oracle with Constant Stretch 

The overall structure of the distance oracle presented in this section is also reused for the (1 -(-e)-approxiniate 
distance oracle. The basic idea is to use Thorup's distance labels for long-range distances, which are the 
distances of length at least roughly logn, and to use sparse neighborhood covers (Section 2.4) for log log n 
different levels to approximate short-range distances. 
We prove Proposition 1, which we restate here. 
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Proposition 1. For any undirected planar graph G on n nodes with average weight bounded by log n for 
some constant 9 >0 there exists an 0{l) -approximate distance oracle with query time 0(logloglog?i) using 
space O(nloglogn), and preprocessing time 0{n\og'^ n + SAfC{n) -log logn), where SJ\fC{n) denotes the time 
required to compute a sparse neighborhood cover. 

3.1 Preprocessing Algorithm 

Let e = 0.5 (any constant e e (0, 1/2) works). The algorithm is described by the following pseudocode. 

preprocess G = (V, E) 

(i) Preparing for Long-range Queries 

compute a (5-dominating set L with 5 — [e~^ lognj as in [KP98] 

(for graphs with edge weights s.t. ^ w(e) < 0(nlog* n), set 5 = [e~^ log^^^ n\ instead 

and replace each edge e by w{e) edges before computing the (5-dominating set) 
for each node / g L 

compute Thorup's distance label [Tho04] (see Lemma 2) 
for each node v €V 

compute its nearest landmark node ly (the node ly ^ L that minimizes dciv^ly)) 
store {ly,dG{v, ly)) 

(ii) Preparing for Short-range Queries 
for every integer i > with 2' < [2e^^(5] 

compute a sparse neighborhood cover with radius r = 2' as in [BLT07] 
let = {GJ} denote this cover 
for each node v €V 

store the hst of graphs Gj G with v G V"(G* ) 

Space requirements Each distance label has size O(e^^logn) (see Lemma 2). The space requirement 
for the data structure computed in the first step is thus 0{n). In the second step, we iterate through 
0(log(e^^ logn)) = 0(loglog(n) +log(l/e)) levels (for weighted graphs with — 0{n\og^ n) for a 

constant 9 > Q we have 0{9\og\ogn + log(l/e)) levels). At each level i, for each node, we store a list of 
graphs L'^{v) C of constant length |L*(u)| = 0{1) [BLT07]. Over all levels, the space requirement is thus 
0(rilog(e~^ logn)). Here we assume that identifiers of length O(logn) bits can be stored using constant 
space, which is a common assumption in the word RAM model [CR73]. 

Preprocessing time The preprocessing time is dominated by the time required to compute the neighbor- 
hood covers SMC{n), which is bounded by a polynomial in the number of nodes n [BLT07]. The dominating 
set [KP98] , the distance labels [Tho04] , and the nearest landmarks [ErwOO] can be computed in almost linear 
time in n. 

3.2 Query Algorithm 

The algorithm is described by the following pseudocode, 
query {u, v) 

return the minimum of the long-range and the short-range query algorithm 

(i) Long-range Query 

return dG{u, ly) + dQ{lu, ly) + daily, v), where dci', ■) is the estimate obtained from the labels 

(ii) Short-range Query 

binary search for a level i such that 3Gj eG' ■.u,ve F(G;) and ^G}7^ G G'-^ : u,v e y(G}7^) 
return 2p2*, where p is the constant for the radius in Lemma 4 (here: p — 24) 
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Running time Computing the long-range result requires time 0(l/e) [Tho04]. For the short-range pairs, 
a binary search among 0(log(e~^ logn)) levels can be done in time 0(loglog(e~^ logn)). At each search level 
we need to compute the intersection of two sets of constant size (recall that each node is in at most 0(1) 
graphs Gj per level i [BLT07]). 

Stretch analysis For any pair of nodes at distance doiujv) > e^^S — e^^logn, the long-range 

algorithm returns a (1 + 6e) -approximation for daiu, v), since, using the triangle inequality, 

dGiu,v) < daiuAu) + dailu.lv) + dailv^v) 

< 6 + {l + e)dG{lu,lv) + S 

< S + {l + e){S + dG{u,v)+S) + S 

< {l + e)dG{u,v) + {A + 2e)S. 

For nodes at distance dG{u,v) < e~^S, the short-range algorithm returns a 4p-approximation (recall that p 
is the constant for the radius in [BLT07]). The graph G* with u,w e V{G]) at level i is a certificate that 
dG{u, v) < 2p2*. Since there is no graph G^7^ with u,v & F(G*7^) at level i — 1, the u-to-v distance satisfies 
dG{u,v)>T-\ 

We conclude this section by noting that, using Lemma 4, we have p = 24 and thus the stretch of this 
oracle is at most 96. The construction of Busch et al. [BLT07] works for general minor-free graphs. There 
may be a construction with smaller radius for planar graphs. 



4 (1 + e)— Approximate Distance Oracle 

We prove the main result (Theorem 2). The distance oracle presented in this section is based on the oracle 
with constant stretch as described in Section 3. 

4.1 Overview 

We first run the preprocessing algorithm of Section 3 with e being the actual value chosen by the application 
divided by 6. Note that the only distances approximated with stretch more than 1 + e are the ones in the 
range [1, log(n)]. This range of logarithmic size is then handled by data structures for log-logarithmically 
many levels. At level i, we are interested in distances in the range [2% 2*+^). 

The constant-stretch distance oracle uses a very crude estimate: if two nodes are contained in the same 
graph Gj with diameter 0(2*) but they are not together in a graph at level « — 1, they must be at distance 
0(2*) (0(2*) due to the diameter of GJ and r2(2') since the two nodes were not together in any graph at 
level z — 1). In the following, we provide an oracle that outputs a more precise estimate for pairs of nodes at 
distance 9(2*). 

We are aware of exact oracles for planar graphs that can answer bounded-length distance queries, which are 
distance queries for pairs at constant distance. For planar graphs, Kowalik and Kurowski [KK06] provide 
such a distance oracle that uses linear space (see [DKTIO] for an extension to sparse graphs). However, 
we cannot use their data structures, since the query time of their oracle is exponential in the length. In 
our oracle, short distances may be up to logarithmic in n. Another approach would be to use a distance 
oracle for planar graphs with bounded tree-width (see [MS12]): since diameter 8(2*) implies tree-width 
w = 0(2*) [EppOO, DII04], the query time can be made almost proportional to w. However, here we aim at 
query time almost proportional to 1/e instead. 

For the (1 -f e)-stretch oracle, we introduce an additional data structure for the level graphs G*. The 
difference to the oracle in Section 3 — one of our main technical contributions — is the following data 
structure that can answer approximate distance queries with additive stretch eA for planar graphs with 
diameter 0(A). 
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Theorem 3 (Additive-Stretch Approximate Distance Oracle). For any integer A, for any e > 0, and for any 

planar graph on n nodes with diameter C ■ A for any constant C > 1 there is an approximate distance oracle 

with additive stretch eA using space 0{Cn log log(l/ e) log* (n)) and query timeO{Ce~^ log^ ( 1/e) log log ( 1 /e) log* (n) ) . 

Furthermore, this distance oracle can he computed in time 0(Cne~^ log^(n) log* (n)). 

Let us emphasize that the oracle in Theorem 3 works for edge-weighted planar graphs as well. In the 
case of weighted graphs, we require that the weighted diameter (defined by the length of the longest shortest 
path) is bounded by C • A. 

4.2 High-level Construction: Preprocessing and Query Algorithms 

We now use the additive- stretch oracle as in Theorem 3 to prove Theorem 2, which we restate here. 

Theorem 2. For any undirected planar graph G on n nodes with average weight < log^ n for some 
constant > 0, and for any e > there exists a {1 + e) -approximate distance oracle with query time 
0{e^^ log^(l/e) loglog(l/e) log*(n)+logloglog(n)) using space 0{n [loglog(n) + log(l/e)] log*(?i) loglog(l/e)), 
and preprocessing time 0(ne^^ log^(n) loglog(n) log*(n) -f SJ\fC{n) ■ loglogn), where SMC{n) denotes the 
time required to compute a sparse neighborhood cover. 

Proof of Theorem 2. We describe the peprocessing and query algorithms. Since these algorithms are very 
similar to the algorithms in Section 3, we mainly highlight the differences. 

Preprocessing Algorithm We run the preprocessing algorithm of Section 3 with e being the actual 
value chosen by the application divided by a small constant (six suffices). For each level 2', for each graph 
Gj, we compute the additive-stretch oracle as in Theorem 3. The total space requirement per level is 
0{n [loglog(n) -I- log(l/e)] log*(n) loglog(l/e)). The preprocessing time again depends on the time SNC{n) 
required to compute the sparse neighborhood covers, since, for each level 2', we compute a sparse neighbor- 
hood cover and the additive-stretch oracle in each subgraph (Theorem 3). 

Query Algorithm We use the query algorithm of Section 3 with the following adaptation. After the 
binary search for the lowest level 2* with a graph G* that contains both u and v, we compute the result of 
the additive-stretch oracle for this level and also for the [log2(2p)] levels above (p again denotes the radius 
in Lemma 4) . The reason for this is that u and v may be in G* at level 2' but u and v may be at distance cD 
for some c G [1, 2p]. At a lower level, it is not guaranteed that G^ actually contains an approximate shortest 
path. Since p — 0(1), the query time is at most 0(e~^ log^(l/e) loglog(l/e) log*(n) -I- log log log (n)). □ 

4.3 Polynomial Weights 

The proof of Theorem 1 is based on the proof of Theorem 2. 

Proof of Theorem 1. There are two differences to the proof of Theorem 2. 

Preprocessing Algorithm We need to consider O(logn) levels instead of O(loglogn) levels, since a 
node may be at distance 0{poly{n)) from its nearest dominating node. The space complexity increases 
from 0{n [loglog(n) -|- log(l/e)] log*(n) loglog(l/e)) to 0{n [log(n) -I- log(l/e)] log*(n) loglog(l/e)). We also 
preprocess Thorup's distance oracle for e = 1/2 (any constant works). The space required for this is O(nlogn) 
and thus the overall space does not increase any further. 

Query Algorithm Instead of running a binary search to find the right level, we can now just query 
Thorup's distance oracle for constant e to identify the right level. Everything else remains the same. The 
query time is thus 0(e^^ log^(l/e) loglog(l/e) log*(n)). □ 
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5 Additive-Stretch Approximate Distance Oracle 



This section is devoted to the proof of Theorem 3. While there are distance oracles for special cases of planar 
graphs and special kinds of queries [DPZ95, DPZOO, DPZ91, CXOO, KK06, MS12], we unfortunately cannot 
use them for our purpose. 

Let us first restate Theorem 3. 

Theorem 3. For any integer A, for any e > 0, and for any planar graph on n nodes with diameter 
C ■ A for any constant C > 1 there is an approximate distance oracle with additive stretch eA using space 
0(Cn log log(l/e) log* (n)) and query time 0(Ce~^ log^(l/e) log log(l/e) log*(n)). Furthermore, this distance 
oracle can be computed in time 0(Cne^^ log'^(n) log* (n)). 

5.1 Overview 

We adapt Thorup's distance oracle as follows. During preprocessing, we recursively separate the planar 
graph G into at least two pieces each with at most half the weight (Lemma 1). Analogously to computing an 
r-division [Pre87], we separate G in a way such that both (i) the sizes of the resulting pieces are balanced and 
(a) the boundary dP of each piece P consists of at most a constant number of shortest paths (see [Tho04, 
Section 2.5.1] for a similar construction). We stop the separation as soon as subgraphs have logarithmic size. 
There are three types of shortest s — t paths we need to consider. 

1. Any shortest path between nodes in two different pieces must pass through a boundary path. 

2. For nodes in the same piece, the shortest path may leave through one boundary path and re-enter 
through another boundary path. 

3. For nodes in the same piece P, the shortest path may lie entirely within P. 
The third type of paths is handled by recursion. 

The first two types of paths are handled as follows. Instead of letting the s — t path pass through any node 
of to boundary, that is, any node on a separator path q G Q,we restrict the possible portals on the boundary 
to the e-cover for each path Q (which is chosen as a set of equally-spaced nodes C{Q) as in Lemma 3). Since 
C{Q) is an e-cover, the error we introduce is bounded by e • A. 

The savings in space can be obtained since (i) for each node we only need the distances to the 0{e^^) 
nodes in the covers on the boundary as opposed to the 0{e~^ logn) nodes on all separator paths as in [Tho04] 
and (ii) we can compute these distances at query time using the Cycle MSSP data structure (Lemma 5). 

5.2 Definitions 

Throughout, pieces are called P, P' , Pi, . . . and paths are called Q,Q' ,Qj, . . . . Let dP denote the set of 
separator paths on the boundary of a piece P. By #9P we denote the number of shortest paths on the 
boundary of a piece P. For a separator path Q, let C{Q) C Q denote the e-cover of Q, which is a set of 
0(l/e) equally-spaced nodes on Q. We call each node c G C{Q) a portal. For any piece P, let C{P) denote 
the set of portals on its boundary, C{P) := IJ C{Q). 

QGdP 

5.3 Preprocessing Algorithm 

We first compute a decomposition we call shortest-path s -division (similar to an r-division), by repeatedly 
using shortest-path separators (as in Lemma 1). The separator paths form a decomposition tree. Then, 
we compute an e-cover for each path. We further connect each portal node in an e-cover to all the portal 
nodes in covers on higher levels of the decomposition tree. Finally, we compute distances from nodes to their 
portals and store them implicitly using the Cycle MSSP data structure. 
Compared to [Tho04], the main technical differences are: 
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• selective storing of distances to portals 

we store distances to portals in e-covers only for a restricted set of nodes 

• global e-covers 

we compute one cover per path, as opposed to one cover per pair of path and node 
(to improve query time and space requirements) 

Our preprocessing algorithm is outlined in the following pseudocode. 

preprocess H — (V, E) 

let n = \V\ 

let the set of pieces V = {V} 

let the tree of boundary paths 5 = {} 

(*) beginning of partitioning {shortest-path division, similar to an r-division) 
While there is a piece P eV with #9P > 10 or size |P| > [e~^ logn] 
we compute a balanced separation with node weights as follows: 
If a piece has too many boundary paths, #9-P > 10 

assign weight 1 to each endpoint of any boundary path Q e dP 
assign weight to all the remaining nodes 
Else 

assign weight 1 to each node 
let T (u) , T (v) , T (w) be the three paths obtained by Lemma 1 applied to P, weighted as above 
add T{u), T{v), T{w) to the set of separator paths S 
remove P from V, partition P into pieces Pi, P2, ■ ■ ■ and add them to V 
(★) partitioning with 0(1) boundary paths per region computed 

(f ) inter-piece approximate distance oracle 
For Each separator path Q G S 

compute an e~cover C{Q) C Q (Lemma 3) 

For Each portal c G C{Q) 

compute and store the distances to all portals on ancestor paths Q' 
For Each piece P eV and separator path Q G dP 

augment the graph induced by the piece P with infinite edges such that C{Q) form the outer face 

compute the Cycle MSSP Data Structure for C{Q) and this augmented graph (as in Lemma 5) 
(f ) inter-piece oracle computed 

(I) recursive call 

For Each piece P eV 

If |P| > [e-^l 

recurse on the graph induced by P 

Else 

compute an exact distance oracle for P as in Lemma 6 



Claim 1 (Shortest-path separator r-division). At step (★) we have obtained a partitioning of the vertex set 
V into pieces Pi , P2 , ■ • • with the following properties: 

• each piece Pi has size \Pi \ — 0{e^^ logn), 

• the number of boundary paths ^dPi for each piece Pi is at most ten, and 

• the total number of pieces is at most 0{ne/ logn) . 
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Proof. Our construction is similar to that of an r-division [Fre87, WNlOb] but using shortest-path separators 
instead of cycle separators. It is also essentially the same as in [Tho04, Section 2.5.1]; see [Tho04, Fig. 4] 
for an illustration. Similar divisions have been used in [AGK+98, GKP95]. We recursively separate V into 
pieces using root-paths of a single shortest-path tree as in Lemma 1. 

Whenever a the boundary of a piece consists of more than 10 root-paths, we separate it using Lemma 1 
such that the boundary paths will be partitioned in a balanced way. Since we always use the same shortest- 
path tree T, we can use the following vertex weights: all the endpoints (leafs in T) of a previously selected 
root-path are weighted with one and all the remaining nodes are weighted with zero. 

The third property can be proven using the following observation. Let us track a piece P during the 
execution of the preprocessing algorithm. Since all the separator paths are taken from the same shortest-path 
tree T, the number of boundary paths #9-P increases only if P is the reason why Lemma 1 is invoked. A piece 
P' resulting from that invocation can have more than 10 boundary nodes by inheriting 10—1 paths from P 
and gaining 3 paths from Lemma 1. By one more invocation of Lemma 1, the number of boundary paths of 
P' can be reduced to less than 10. When the recursion stops at P', either P' or P has size 9(e~^ logn). □ 

Claim 2. The total space requirement per recursion level is at most 0(nloglog(l/e)). 

Proof. At each recursion level, for each portal in an e-cover c G Q, the above algorithm stores 0(e~^logn) 
portals and the corresponding distances, since, for each portal, we store the distances to all the portals on 
higher levels (as in [Tho04]). 

By Claim 1, the total number of pieces per level is at most 0(ne^/ log n) {sic!). Each piece is surrounded 
by at most 10 paths on each of which we have 0(l/e) portals. The total number of portals per level is thus 
at most 0{ne/ \ogn). Since each distance label requires space 0{e~^ logn), the total space per level for this 
step is 0{n). 

At each recursion level, for each node w e V^, the above algorithm stores a Cycle MSSP data structure for 
the portals on each boundary path (note that we can only do so since there is one fixed cover per boundary 
path as opposed to one cover per pair of node and boundary path). Since the cycle has 0(l/e) nodes, one 
Cycle MSSP data structure requires space 0(nloglog(l/e)) (see Lemma 5). By Claim 1, each piece has at 
most ten boundary paths. 

On the lowest level, we also store a distance oracle for a planar graph on 0(e^^) nodes. The overall space 
requirement for all these distance oracles is 0(n log log(l/e)) (Lemma 6). □ 

Note that this recursive algorithm reduces the graph size from n log n at each level. It follows directly 
that the recursion depth is at most 0(log* n). The data structure computed by the above algorithm thus 
occupies space 0(n log log(l/e) log* n). 

5.4 Query Algorithm 

The query algorithm, given a pair of nodes returns an estimate for dQ{u,v). The main differences to 
Thorup's distance oracle [Tho04] are: 

• two-way approximation 

instead of approximating the distance by ^^(m, g) -I- dQ{q,q') + dQ{q' ,v) for two portals q,q' on a 
separator path as in Thorup's distance oracle, we use the estimate da{u, Cu) + dcicu, q) for a portal Cu 
to approximate dG{u,q) and, analogously, for dciq^v) (by doing so, each node only needs to compute 
and encode distances to portals on its boundary paths as opposed to all logn levels) 

• Monge search for the optimal portal 

since we have only one e-cover per path, we can use the non-crossing property as in [AKM+87, KK90, 
FR06] to compute the two-way approximation 

we emphasize that there is also only one e~cover per path on higher levels (not just at the boundary 
of a piece) 
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Let us momentarily assume that the two query nodes u and v are in different pieces P„ and Py, respec- 
tively.^ Since we store distances from u to its cover C(P„) (and from v to C(P„)) during preprocessing, we 
may return the minimum 

min dGiu,Cu) + d{cu,Cy) + dcicv^v), 

where •) is a (1 + e)-approximation for dQ^-, •). We can compute d(c„, c^) for the two portals c„, since 
they have a lowest common ancestor in the separator decomposition tree consisting of at most three shortest 
paths, to whose e-covers we computed and stored the shortest-path distances during preprocessing. 

Claim 3. The above estimate is at most 6eA longer than dG(u,v). 

Proof. Any shortest path from u to u must intersect the boundary of both pieces. Our approximation uses 
at most one additional separator path (one of the paths in the least common ancestor separator). Due to 
Lemma 3, the additive distortion is at most 2eA per path. □ 

Per level i, we compute the minimum min dQ{u,Cu) + d(cu,Cy) + dG(cy,v). We know the 

Cuec{P^),c^ec{P^,) 

distance from u to the 0(l/e) portals c„ e C(P„) on its boundary paths. We also know the distance from 
these portals to the portals on the separator paths. We wish to efficiently compute the distance from u 
to the portals on the relevant separator paths to simultaneously compute all the relevant d(c„,Ct,). This 
computation can be done using the efficient Dijkstra implementation of [FR06]. 

Fakcharocnphol and Rao [FROG, Section 4.1 (and also Sec. 2.3)] devised an ingenious implementation of 
Dijkstra's algorithm [Dij59] that computes a shortest-path tree of a complete bipartite graph Hi U H2 in 
time 0{h\ogh) where h — \Hi\ + \H2\ (as opposed to 0{\Hi\ ■ |i?2|)) as long as the edge weights obey the 
Monge property (which essentially means that they correspond to distances in a planar graph wherein the 
nodes of Hi and H2 lie on a constant number of faces). We refer to this implementation as FR-Dijkstra. 

The query algorithm works as described in the following pseudocode. 

query(u, v) 

return the minimum among the results from all recursion levels: 
For Each recursion level 
let u G Pu and v & Py 

For Each pair of boundary paths (Qu,Qv) G dPu x dPy 

determine the three separator paths Qi,Q2,Q3 that separate Q„ from (as in [Tho04]) 
For Each Q € {Qi, (92, Q3} 

compute min d(u, q) for all q G Q simultaneously using FR-Dijkstra 

(analogously for v) 

compute min c?(u, q) + d{q, v) and keep it if it is the new minimum 

at the lowest recursion level 
If P„ = P„ Then 

query the exact distance oracle 

Claim 4. The query algorithm runs in time 0(e~^ log^(l/e) loglog(l/e) log*(n)). 

Proof. We compute the distances from u to the portals on the boundary path C{Qu) using the Cycle MSSP 
data structure in time 0(e~^ log^(l/e) loglog(l/e)) (Lemma 5). 

For each pair of boundary paths {Qm Qv) G 9P„ x dPy there is a constant number of separator paths Q 
on higher levels we need to consider (see constant query time oracle in [Tho04]). 

^Even if the two query nodes are in tlie same piece, it could still be that the shortest path passes through the separator 
(which consists of at most ten shortest paths). To account for these cases, we compute the minimum among paths passing 
through the separator (this computation is the same as if the nodes were in different pieces). Then we recurse until some 
termination criterion on the size is met. 
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Per separator path Q the FR-Dijkstra algorithm runs m time 0{{\C{Qu)\ + |C((5)|) log(|C((5M)| + 
|C'(Q)|)) = 0(e~^ log(l/e)). (Since we do not actuaUy need the on-Une version of the bipartite Monge 
search in [FR06], the log(l/e) factor can be ehminated by using [AKM+87, KK90] instead. However, we use 
FR-Dijkstra within the Cycle MSSP data structure, whose query time currently dominates the overall query 
time anyway.) 

This search is done for all the 0(log* n) levels of the recursion. On the lowest level, we also query the 
exact distance oracle in time 0(e~^ log^(l/e) loglog(l/e)) (Lemma 6). □ 



6 Conclusion 

Our (1 + e)-approximate distance oracle for planar graphs has a better space-time tradeoff — both in terms 
of n and e — than previous oracles. The improved tradeoff currently comes at a cost: the oracle cannot be 
distributed as a labeling scheme (it is however not clear whether o(log'^ n) bit approximate distance labels for 
planar graphs exist at all) and, for the improvement in terms of n, there is a dependency on the largest edge 
weights (Thorup's oracle depends on the largest integer weight for digraphs only). We believe that these 
sacrifices may be worthwhile since they allow us to achieve almost linear dependency on 1/e and remove 
the notorious logarithmic dependency on n, contributing to an important next step towards a linear-space 
approximate distance oracle with almost constant query time. 

An intermediate goal could be to find a linear-space distance oracle with arbitrary constant stretch and 
(almost) constant query time. For our construction, the overhead required to transform a constant-stretch 
oracle into a (1 -I- e)-stretch oracle is proportional to only log*(n) (and some overhead in log(l/e)). Although 
our construction of the (1 -I- e)-stretch oracle depends on the specific features of the constant-stretch oracle, 
given a better constant-stretch approximate distance oracle, it may be quite possible to improve our (1 -I- e)- 
stretch oracle as well. 
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